• Aequitas Flow: Streamlining Fair ML Experimentation

    Updated: 2024-12-31 19:41:44
    : Home Page Papers Submissions News Editorial Board Special Issues Open Source Software Proceedings PMLR Data DMLR Transactions TMLR Search Statistics Login Frequently Asked Questions Contact Us Aequitas Flow : Streamlining Fair ML Experimentation Sérgio Jesus , Pedro Saleiro , Inês Oliveira e Silva , Beatriz M . Jorge , Rita P . Ribeiro , João Gama , Pedro Bizarro , Rayid Ghani 25(354 1 7, 2024. Abstract Aequitas Flow is an open-source framework and toolkit for end-to-end Fair Machine Learning ML experimentation , and benchmarking in Python . This package fills integration gaps that exist in other fair ML packages . In addition to the existing audit capabilities in Aequitas , the Aequitas Flow module provides a pipeline for fairness-aware model training , hyperparameter optimization , and

  • DAG-Informed Structure Learning from Multi-Dimensional Point Processes

    Updated: 2024-12-31 19:41:44
    Home Page Papers Submissions News Editorial Board Special Issues Open Source Software Proceedings PMLR Data DMLR Transactions TMLR Search Statistics Login Frequently Asked Questions Contact Us DAG-Informed Structure Learning from Multi-Dimensional Point Processes Chunming Zhang , Muhong Gao , Shengji Jia 25(352 1 56, 2024. Abstract Motivated by inferring causal relationships among neurons using ensemble spike train data , this paper introduces a new technique for learning the structure of a directed acyclic graph DAG within a large network of events , applicable to diverse multi-dimensional temporal point process MuTPP data . At the core of MuTPP lie the conditional intensity functions , for which we construct a generative model parameterized by the graph parameters of a DAG and develop an

  • How Two-Layer Neural Networks Learn, One (Giant) Step at a Time

    Updated: 2024-12-31 19:41:44
    , Home Page Papers Submissions News Editorial Board Special Issues Open Source Software Proceedings PMLR Data DMLR Transactions TMLR Search Statistics Login Frequently Asked Questions Contact Us How Two-Layer Neural Networks Learn , One Giant Step at a Time Yatin Dandi , Florent Krzakala , Bruno Loureiro , Luca Pesce , Ludovic Stephan 25(349 1 65, 2024. Abstract For high-dimensional Gaussian data , we investigate theoretically how the features of a two-layer neural network adapt to the structure of the target function through a few large batch gradient descent steps , leading to an improvement in the approximation capacity with respect to the initialization . First , we compare the influence of batch size to that of multiple but finitely many steps . For a single gradient step , a batch of

  • Boundary constrained Gaussian processes for robust physics-informed machine learning of linear partial differential equations

    Updated: 2024-12-31 19:41:44
    Home Page Papers Submissions News Editorial Board Special Issues Open Source Software Proceedings PMLR Data DMLR Transactions TMLR Search Statistics Login Frequently Asked Questions Contact Us Boundary constrained Gaussian processes for robust physics-informed machine learning of linear partial differential equations David Dalton , Alan Lazarus , Hao Gao , Dirk Husmeier 25(272 1 61, 2024. Abstract We introduce a framework for designing boundary constrained Gaussian process BCGP priors for exact enforcement of linear boundary conditions , and apply it to the machine learning of initial boundary value problems involving linear partial differential equations PDEs In contrast to existing work , we illustrate how to design boundary constrained mean and kernel functions for all classes of

  • aeon: a Python Toolkit for Learning from Time Series

    Updated: 2024-12-31 19:41:44
    : Home Page Papers Submissions News Editorial Board Special Issues Open Source Software Proceedings PMLR Data DMLR Transactions TMLR Search Statistics Login Frequently Asked Questions Contact Us aeon : a Python Toolkit for Learning from Time Series Matthew Middlehurst , Ali Ismail-Fawaz , Antoine Guillaume , Christopher Holder , David Guijo-Rubio , Guzal Bulatova , Leonidas Tsaprounis , Lukasz Mentel , Martin Walter , Patrick Schäfer , Anthony Bagnall 25(289 1 10, 2024. Abstract aeon is a unified Python 3 library for all machine learning tasks involving time series . The package contains modules for time series forecasting , classification , extrinsic regression and clustering , as well as a variety of utilities , transformations and distance measures designed for time series data . aeon

  • Geometric Learning with Positively Decomposable Kernels

    Updated: 2024-12-31 19:41:44
    Home Page Papers Submissions News Editorial Board Special Issues Open Source Software Proceedings PMLR Data DMLR Transactions TMLR Search Statistics Login Frequently Asked Questions Contact Us Geometric Learning with Positively Decomposable Kernels Nathael Da Costa , Cyrus Mostajeran , Juan-Pablo Ortega , Salem Said 25(326 1 42, 2024. Abstract Kernel methods are powerful tools in machine learning . Classical kernel methods are based on positive definite kernels , which enable learning in reproducing kernel Hilbert spaces RKHS For non-Euclidean data spaces , positive definite kernels are difficult to come by . In this case , we propose the use of reproducing kernel Krein space RKKS based methods , which require only kernels that admit a positive decomposition . We show that one does not

  • Causal Discovery with Generalized Linear Models through Peeling Algorithms

    Updated: 2024-12-31 19:41:44
    Home Page Papers Submissions News Editorial Board Special Issues Open Source Software Proceedings PMLR Data DMLR Transactions TMLR Search Statistics Login Frequently Asked Questions Contact Us Causal Discovery with Generalized Linear Models through Peeling Algorithms Minjie Wang , Xiaotong Shen , Wei Pan 25(310 1 49, 2024. Abstract This article presents a novel method for causal discovery with generalized structural equation models suited for analyzing diverse types of outcomes , including discrete , continuous , and mixed data . Causal discovery often faces challenges due to unmeasured confounders that hinder the identification of causal relationships . The proposed approach addresses this issue by developing two peeling algorithms bottom-up and top-down to ascertain causal relationships

  • PROMISE: Preconditioned Stochastic Optimization Methods by Incorporating Scalable Curvature Estimates

    Updated: 2024-12-31 19:41:44
    : Home Page Papers Submissions News Editorial Board Special Issues Open Source Software Proceedings PMLR Data DMLR Transactions TMLR Search Statistics Login Frequently Asked Questions Contact Us PROMISE : Preconditioned Stochastic Optimization Methods by Incorporating Scalable Curvature Estimates Zachary Frangella , Pratik Rathore , Shipu Zhao , Madeleine Udell 25(346 1 57, 2024. Abstract Ill-conditioned problems are ubiquitous in large-scale machine learning : as a data set grows to include more and more features correlated with the labels , the condition number increases . Yet traditional stochastic gradient methods converge slowly on these ill-conditioned problems , even with careful hyperparameter tuning . This paper introduces PROMISE Preconditioned Stochastic Optimization Methods by

  • Commutative Scaling of Width and Depth in Deep Neural Networks

    Updated: 2024-12-31 19:41:44
    Home Page Papers Submissions News Editorial Board Special Issues Open Source Software Proceedings PMLR Data DMLR Transactions TMLR Search Statistics Login Frequently Asked Questions Contact Us Commutative Scaling of Width and Depth in Deep Neural Networks Soufiane Hayou 25(299 1 41, 2024. Abstract In this paper , we study the commutativity of infinite width and depth limits in deep neural networks . Our aim is to understand the behavior of neural functions functions that depend on a neural network model as width and depth go to infinity in some sense and eventually identify settings under which commutativity holds , i.e . the neural function tends to the same limit no matter how width and depth limits are taken . In this paper , we formally introduce and define the commutativity framework

  • Robust Principal Component Analysis using Density Power Divergence

    Updated: 2024-12-31 19:41:44
    Home Page Papers Submissions News Editorial Board Special Issues Open Source Software Proceedings PMLR Data DMLR Transactions TMLR Search Statistics Login Frequently Asked Questions Contact Us Robust Principal Component Analysis using Density Power Divergence Subhrajyoty Roy , Ayanendranath Basu , Abhik Ghosh 25(324 1 40, 2024. Abstract Principal component analysis PCA is a widely employed statistical tool used primarily for dimensionality reduction . However , it is known to be adversely affected by the presence of outlying observations in the sample , which is quite common . Robust PCA methods using M-estimators have theoretical benefits , but their robustness drop substantially for high dimensional data . On the other end of the spectrum , robust PCA algorithms solving principal

  • Graphical Dirichlet Process for Clustering Non-Exchangeable Grouped Data

    Updated: 2024-12-31 19:41:44
    Home Page Papers Submissions News Editorial Board Special Issues Open Source Software Proceedings PMLR Data DMLR Transactions TMLR Search Statistics Login Frequently Asked Questions Contact Us Graphical Dirichlet Process for Clustering Non-Exchangeable Grouped Data Arhit Chakrabarti , Yang Ni , Ellen Ruth A . Morris , Michael L . Salinas , Robert S . Chapkin , Bani K . Mallick 25(323 1 56, 2024. Abstract We consider the problem of clustering grouped data with possibly non-exchangeable groups whose dependencies can be characterized by a known directed acyclic graph . To allow the sharing of clusters among the non-exchangeable groups , we propose a Bayesian nonparametric approach , termed graphical Dirichlet process , that jointly models the dependent group-specific random measures by

  • Compressed and distributed least-squares regression: convergence rates with applications to federated learning

    Updated: 2024-12-31 19:41:44
    : Home Page Papers Submissions News Editorial Board Special Issues Open Source Software Proceedings PMLR Data DMLR Transactions TMLR Search Statistics Login Frequently Asked Questions Contact Us Compressed and distributed least-squares regression : convergence rates with applications to federated learning Constantin Philippenko , Aymeric Dieuleveut 25(288 1 80, 2024. Abstract In this paper , we investigate the impact of compression on stochastic gradient algorithms for machine learning , a technique widely used in distributed and federated learning . We underline differences in terms of convergence rates between several unbiased compression operators , that all satisfy the same condition on their variance , thus going beyond the classical worst-case analysis . To do so , we focus on the

  • Contamination-source based K-sample clustering

    Updated: 2024-12-31 19:41:44
    Home Page Papers Submissions News Editorial Board Special Issues Open Source Software Proceedings PMLR Data DMLR Transactions TMLR Search Statistics Login Frequently Asked Questions Contact Us Contamination-source based K-sample clustering Xavier Milhaud , Denys Pommeret , Yahia Salhi , Pierre Vandekerkhove 25(287 1 32, 2024. Abstract In this work , we investigate the K$-sample clustering of populations subject to contamination phenomena . A contamination model is a two-component mixture model where one component is known standard behaviour and the second component , modeling a departure from the standard behaviour , is unknown.When K$ populations from such a model are observed we propose a semiparametric clustering methodology to detect which populations are impacted by the same type of

  • Value-Distributional Model-Based Reinforcement Learning

    Updated: 2024-12-31 19:41:44
    Home Page Papers Submissions News Editorial Board Special Issues Open Source Software Proceedings PMLR Data DMLR Transactions TMLR Search Statistics Login Frequently Asked Questions Contact Us Value-Distributional Model-Based Reinforcement Learning Carlos E . Luis , Alessandro G . Bottero , Julia Vinogradska , Felix Berkenkamp , Jan Peters 25(298 1 42, 2024. Abstract Quantifying uncertainty about a policy's long-term performance is important to solve sequential decision-making tasks . We study the problem from a model-based Bayesian reinforcement learning perspective , where the goal is to learn the posterior distribution over value functions induced by parameter epistemic uncertainty of the Markov decision process . Previous work restricts the analysis to a few moments of the distribution

  • Wasserstein Proximal Coordinate Gradient Algorithms

    Updated: 2024-12-31 19:41:44
    Home Page Papers Submissions News Editorial Board Special Issues Open Source Software Proceedings PMLR Data DMLR Transactions TMLR Search Statistics Login Frequently Asked Questions Contact Us Wasserstein Proximal Coordinate Gradient Algorithms Rentian Yao , Xiaohui Chen , Yun Yang 25(269 1 66, 2024. Abstract Motivated by approximation Bayesian computation using mean-field variational approximation and the computation of equilibrium in multi-species systems with cross-interaction , this paper investigates the composite geodesically convex optimization problem over multiple distributions . The objective functional under consideration is composed of a convex potential energy on a product of Wasserstein spaces and a sum of convex self-interaction and internal energies associated with each

  • Optimistic Search: Change Point Estimation for Large-scale Data via Adaptive Logarithmic Queries

    Updated: 2024-12-31 19:41:44
    : Home Page Papers Submissions News Editorial Board Special Issues Open Source Software Proceedings PMLR Data DMLR Transactions TMLR Search Statistics Login Frequently Asked Questions Contact Us Optimistic Search : Change Point Estimation for Large-scale Data via Adaptive Logarithmic Queries Solt Kovács , Housen Li , Lorenz Haubner , Axel Munk , Peter Bühlmann 25(297 1 64, 2024. Abstract Change point estimation is often formulated as a search for the maximum of a gain function describing improved fits when segmenting the data . Searching one change point through all candidates requires O(n evaluations of the gain function for an interval with n$ observations . If each evaluation is computationally demanding e.g . in high-dimensional models this can become infeasible . Instead , we propose

  • Concentration and Moment Inequalities for General Functions of Independent Random Variables with Heavy Tails

    Updated: 2024-12-31 19:41:44
    The concentration of measure phenomenon serves an essential role in statistics and machine learning. This paper gives bounded difference-type concentration and moment inequalities for general functions of independent random variables with heavy tails. A general framework is presented, which can be used to prove inequalities for general functions once the moment inequality for sums of independent random variables is established. We illustrate the power of the framework by showing how it can be used to derive novel concentration and moment inequalities for bounded, Bernstein's moment condition, weak-exponential, and polynomial-moment random variables. Furthermore, we give potential applications of these inequalities to statistical learning theory.

  • Sample Complexity of Variance-Reduced Distributionally Robust Q-Learning

    Updated: 2024-12-31 19:41:44
    Home Page Papers Submissions News Editorial Board Special Issues Open Source Software Proceedings PMLR Data DMLR Transactions TMLR Search Statistics Login Frequently Asked Questions Contact Us Sample Complexity of Variance-Reduced Distributionally Robust Q-Learning Shengbo Wang , Nian Si , Jose Blanchet , Zhengyuan Zhou 25(341 1 77, 2024. Abstract Dynamic decision-making under distributional shifts is of fundamental interest in theory and applications of reinforcement learning : The distribution of the environment in which the data is collected can differ from that of the environment in which the model is deployed . This paper presents two novel model-free algorithms , namely the distributionally robust Q-learning and its variance-reduced counterpart , that can effectively learn a robust

  • MLRegTest: A Benchmark for the Machine Learning of Regular Languages

    Updated: 2024-12-31 19:41:44
    : Home Page Papers Submissions News Editorial Board Special Issues Open Source Software Proceedings PMLR Data DMLR Transactions TMLR Search Statistics Login Frequently Asked Questions Contact Us MLRegTest : A Benchmark for the Machine Learning of Regular Languages Sam van der Poel , Dakotah Lambert , Kalina Kostyszyn , Tiantian Gao , Rahul Verma , Derek Andersen , Joanne Chau , Emily Peterson , Cody St . Clair , Paul Fodor , Chihiro Shibata , Jeffrey Heinz 25(283 1 45, 2024. Abstract Synthetic datasets constructed from formal languages allow fine-grained examination of the learning and generalization capabilities of machine learning systems for sequence classification . This article presents a new benchmark for machine learning systems on sequence classification called MLRegTest , which

  • pgmpy: A Python Toolkit for Bayesian Networks

    Updated: 2024-12-31 19:41:44
    Bayesian Networks (BNs) are used in various fields for modeling, prediction, and decision making. pgmpy is a python package that provides a collection of algorithms and tools to work with BNs and related models. It implements algorithms for structure learning, parameter estimation, approximate and exact inference, causal inference, and simulations. These implementations focus on modularity and easy extensibility to allow users to quickly modify/add to existing algorithms, or to implement new algorithms for different use cases. pgmpy is released under the MIT License; the source code is available at: https://github.com/pgmpy/pgmpy, and the documentation at: https://pgmpy.org.

  • Evidence Estimation in Gaussian Graphical Models Using a Telescoping Block Decomposition of the Precision Matrix

    Updated: 2024-12-31 19:41:44
    Home Page Papers Submissions News Editorial Board Special Issues Open Source Software Proceedings PMLR Data DMLR Transactions TMLR Search Statistics Login Frequently Asked Questions Contact Us Evidence Estimation in Gaussian Graphical Models Using a Telescoping Block Decomposition of the Precision Matrix Anindya Bhadra , Ksheera Sagar , David Rowe , Sayantan Banerjee , Jyotishka Datta 25(295 1 43, 2024. Abstract Marginal likelihood , also known as model evidence , is a fundamental quantity in Bayesian statistics . It is used for model selection using Bayes factors or for empirical Bayes tuning of prior hyper-parameters . Yet , the calculation of evidence has remained a longstanding open problem in Gaussian graphical models . Currently , the only feasible solutions that exist are for

  • Empirical Design in Reinforcement Learning

    Updated: 2024-12-31 19:41:44
    Home Page Papers Submissions News Editorial Board Special Issues Open Source Software Proceedings PMLR Data DMLR Transactions TMLR Search Statistics Login Frequently Asked Questions Contact Us Empirical Design in Reinforcement Learning Andrew Patterson , Samuel Neumann , Martha White , Adam White 25(318 1 63, 2024. Abstract Empirical design in reinforcement learning is no small task . Running good experiments requires attention to detail and at times significant computational resources . While compute resources available per dollar have continued to grow rapidly , so have the scale of typical experiments in reinforcement learning . It is now common to benchmark agents with millions of parameters against dozens of tasks , each using the equivalent of 30 days of experience . The scale of

  • Bayesian Structural Learning with Parametric Marginals for Count Data: An Application to Microbiota Systems

    Updated: 2024-12-31 19:41:44
    : Home Page Papers Submissions News Editorial Board Special Issues Open Source Software Proceedings PMLR Data DMLR Transactions TMLR Search Statistics Login Frequently Asked Questions Contact Us Bayesian Structural Learning with Parametric Marginals for Count Data : An Application to Microbiota Systems Veronica Vinciotti , Pariya Behrouzi , Reza Mohammadi 25(339 1 26, 2024. Abstract High dimensional and heterogeneous count data are collected in various applied fields . In this paper , we look closely at high-resolution sequencing data on the microbiome , which have enabled researchers to study the genomes of entire microbial communities . Revealing the underlying interactions between these communities is of vital importance to learn how microbes influence human health . To perform

  • Pure Differential Privacy for Functional Summaries with a Laplace-like Process

    Updated: 2024-12-31 19:41:44
    Home Page Papers Submissions News Editorial Board Special Issues Open Source Software Proceedings PMLR Data DMLR Transactions TMLR Search Statistics Login Frequently Asked Questions Contact Us Pure Differential Privacy for Functional Summaries with a Laplace-like Process Haotian Lin , Matthew Reimherr 25(305 1 50, 2024. Abstract Many existing mechanisms for achieving differential privacy DP on infinite-dimensional functional summaries typically involve embedding these functional summaries into finite-dimensional subspaces and applying traditional multivariate DP techniques . These mechanisms generally treat each dimension uniformly and struggle with complex , structured summaries . This work introduces a novel mechanism to achieve pure DP for functional summaries in a separable

  • Sparse Recovery With Multiple Data Streams: An Adaptive Sequential Testing Approach

    Updated: 2024-12-31 19:41:44
    : Home Page Papers Submissions News Editorial Board Special Issues Open Source Software Proceedings PMLR Data DMLR Transactions TMLR Search Statistics Login Frequently Asked Questions Contact Us Sparse Recovery With Multiple Data Streams : An Adaptive Sequential Testing Approach Weinan Wang , Bowen Gang , Wenguang Sun 25(304 1 59, 2024. Abstract Multistage design has been utilized across a variety of scientific fields , enabling the adaptive allocation of sensing resources to effectively eliminate null locations and localize signals . We present a decision-theoretic framework for multi-stage adaptive testing that minimizes the total number of measurements while ensuring pre-specified constraints on both the false positive rate FPR and the missed discovery rate MDR Our method , SMART ,

  • On Doubly Robust Inference for Double Machine Learning in Semiparametric Regression

    Updated: 2024-12-31 19:41:44
    Home Page Papers Submissions News Editorial Board Special Issues Open Source Software Proceedings PMLR Data DMLR Transactions TMLR Search Statistics Login Frequently Asked Questions Contact Us On Doubly Robust Inference for Double Machine Learning in Semiparametric Regression Oliver Dukes , Stijn Vansteelandt , David Whitney 25(279 1 46, 2024. Abstract Due to concerns about parametric model misspecification , there is interest in using machine learning to adjust for confounding when evaluating the causal effect of an exposure on an outcome . Unfortunately , exposure effect estimators that rely on machine learning predictions are generally subject to so-called plug-in bias , which can render naive p-values and confidence intervals invalid . Progress has been made via proposals like targeted

  • Inference on High-dimensional Single-index Models with Streaming Data

    Updated: 2024-12-31 19:41:44
    Home Page Papers Submissions News Editorial Board Special Issues Open Source Software Proceedings PMLR Data DMLR Transactions TMLR Search Statistics Login Frequently Asked Questions Contact Us Inference on High-dimensional Single-index Models with Streaming Data Dongxiao Han , Jinhan Xie , Jin Liu , Liuquan Sun , Jian Huang , Bei Jiang , Linglong Kong 25(337 1 68, 2024. Abstract Traditional statistical methods are faced with new challenges due to streaming data . The major challenge is the rapidly growing volume and velocity of data , which makes storing such huge data sets in memory impossible . The paper presents an online inference framework for regression parameters in high-dimensional semiparametric single-index models with unknown link functions . The proposed online procedure

  • On Causality in Domain Adaptation and Semi-Supervised Learning: an Information-Theoretic Analysis for Parametric Models

    Updated: 2024-12-31 19:41:44
    : Home Page Papers Submissions News Editorial Board Special Issues Open Source Software Proceedings PMLR Data DMLR Transactions TMLR Search Statistics Login Frequently Asked Questions Contact Us On Causality in Domain Adaptation and Semi-Supervised Learning : an Information-Theoretic Analysis for Parametric Models Xuetong Wu , Mingming Gong , Jonathan H . Manton , Uwe Aickelin , Jingge Zhu 25(261 1 57, 2024. Abstract Recent advancements in unsupervised domain adaptation UDA and semi-supervised learning SSL particularly incorporating causality , have led to significant methodological improvements in these learning problems . However , a formal theory that explains the role of causality in the generalization performance of UDA SSL is still lacking . In this paper , we consider the UDA SSL

  • Learning and scoring Gaussian latent variable causal models with unknown additive interventions

    Updated: 2024-12-31 19:41:44
    Home Page Papers Submissions News Editorial Board Special Issues Open Source Software Proceedings PMLR Data DMLR Transactions TMLR Search Statistics Login Frequently Asked Questions Contact Us Learning and scoring Gaussian latent variable causal models with unknown additive interventions Armeen Taeb , Juan L . Gamella , Christina Heinze-Deml , Peter Bühlmann 25(293 1 68, 2024. Abstract With observational data alone , causal structure learning is a challenging problem . The task becomes easier when having access to data collected from perturbations of the underlying system , even when the nature of these is unknown . Existing methods either do not allow for the presence of latent variables or assume that these remain unperturbed . However , these assumptions are hard to justify if the

  • Instrumental Variable Value Iteration for Causal Offline Reinforcement Learning

    Updated: 2024-12-31 19:41:44
    Home Page Papers Submissions News Editorial Board Special Issues Open Source Software Proceedings PMLR Data DMLR Transactions TMLR Search Statistics Login Frequently Asked Questions Contact Us Instrumental Variable Value Iteration for Causal Offline Reinforcement Learning Luofeng Liao , Zuyue Fu , Zhuoran Yang , Yixin Wang , Dingli Ma , Mladen Kolar , Zhaoran Wang 25(303 1 56, 2024. Abstract In offline reinforcement learning RL an optimal policy is learned solely from a priori collected observational data . However , in observational data , actions are often confounded by unobserved variables . Instrumental variables IVs in the context of RL , are the variables whose influence on the state variables is all mediated by the action . When a valid instrument is present , we can recover the

  • A Statistical Experimental Design Method for Constructing Deterministic Sensing Matrices for Compressed Sensing

    Updated: 2024-12-31 19:41:44
    Home Page Papers Submissions News Editorial Board Special Issues Open Source Software Proceedings PMLR Data DMLR Transactions TMLR Search Statistics Login Frequently Asked Questions Contact Us A Statistical Experimental Design Method for Constructing Deterministic Sensing Matrices for Compressed Sensing Youran Qi , Xu He , Tzu-Hsiang Hung , Peter Chien 25(277 1 28, 2024. Abstract Compressed sensing is a signal processing technique used to efficiently acquire and reconstruct signals across various fields , including science , engineering , and business . A critical research challenge in compressed sensing is constructing a sensing matrix with desirable reconstruction properties . For optimal performance , the reconstruction process requires the sensing matrix to have low coherence . Several

  • Identifying Causal Effects using Instrumental Time Series: Nuisance IV and Correcting for the Past

    Updated: 2024-12-31 19:41:44
    : Home Page Papers Submissions News Editorial Board Special Issues Open Source Software Proceedings PMLR Data DMLR Transactions TMLR Search Statistics Login Frequently Asked Questions Contact Us Identifying Causal Effects using Instrumental Time Series : Nuisance IV and Correcting for the Past Nikolaj Thams , Rikke Søndergaard , Sebastian Weichwald , Jonas Peters 25(302 1 51, 2024. Abstract Instrumental variable IV regression relies on instruments to infer causal effects from observational data with unobserved confounding . We consider IV regression in time series models , such as vector auto-regressive VAR processes . Direct applications of i.i.d . techniques are generally inconsistent as they do not correctly adjust for dependencies in the past . In this paper , we outline the difficulties

  • Functional optimal transport: regularized map estimation and domain adaptation for functional data

    Updated: 2024-12-31 19:41:44
    : Home Page Papers Submissions News Editorial Board Special Issues Open Source Software Proceedings PMLR Data DMLR Transactions TMLR Search Statistics Login Frequently Asked Questions Contact Us Functional optimal transport : regularized map estimation and domain adaptation for functional data Jiacheng Zhu , Aritra Guha , Dat Do , Mengdi Xu , XuanLong Nguyen , Ding Zhao 25(276 1 49, 2024. Abstract We introduce a formulation of regularized optimal transport problem for distributions on function spaces , where the stochastic map between functional domains can be approximated in terms of an infinite-dimensional Hilbert-Schmidt operator mapping a Hilbert space of functions to another . For numerous machine learning applications , data can be naturally viewed as samples drawn from spaces of

  • Studying the Interplay between Information Loss and Operation Loss in Representations for Classification

    Updated: 2024-12-31 19:41:44
    Home Page Papers Submissions News Editorial Board Special Issues Open Source Software Proceedings PMLR Data DMLR Transactions TMLR Search Statistics Login Frequently Asked Questions Contact Us Studying the Interplay between Information Loss and Operation Loss in Representations for Classification Jorge F . Silva , Felipe Tobar , Mario Vicuña , Felipe Cordova 25(291 1 71, 2024. Abstract Information-theoretic measures have been widely adopted for machine learning ML feature design . Inspired by this , we look at the relationship between information loss in the Shannon sense and the operation loss in the minimum probability of error MPE sense when considering a family of lossy representations . Our first result offers a lower bound on a weak form of information loss as a function of its

  • Stable and Consistent Density-Based Clustering via Multiparameter Persistence

    Updated: 2024-12-31 19:41:44
    Home Page Papers Submissions News Editorial Board Special Issues Open Source Software Proceedings PMLR Data DMLR Transactions TMLR Search Statistics Login Frequently Asked Questions Contact Us Stable and Consistent Density-Based Clustering via Multiparameter Persistence Alexander Rolle , Luis Scoccola 25(258 1 74, 2024. Abstract We consider the degree-Rips construction from topological data analysis , which provides a density-sensitive , multiparameter hierarchical clustering algorithm . We analyze its stability to perturbations of the input data using the correspondence-interleaving distance , a metric for hierarchical clusterings that we introduce . Taking certain one-parameter slices of degree-Rips recovers well-known methods for density-based clustering , but we show that these methods

  • Desiderata for Representation Learning: A Causal Perspective

    Updated: 2024-12-31 19:41:44
    : Home Page Papers Submissions News Editorial Board Special Issues Open Source Software Proceedings PMLR Data DMLR Transactions TMLR Search Statistics Login Frequently Asked Questions Contact Us Desiderata for Representation Learning : A Causal Perspective Yixin Wang , Michael I . Jordan 25(275 1 65, 2024. Abstract Representation learning constructs low-dimensional representations to summarize essential features of high-dimensional data . This learning problem is often approached by describing various desiderata associated with learned representations e.g . that they be non-spurious , efficient , or disentangled . It can be challenging , however , to turn these intuitive desiderata into formal criteria that can be measured and enhanced based on observed data . In this paper , we take a

  • Faster Randomized Methods for Orthogonality Constrained Problems

    Updated: 2024-12-31 19:41:44
    Home Page Papers Submissions News Editorial Board Special Issues Open Source Software Proceedings PMLR Data DMLR Transactions TMLR Search Statistics Login Frequently Asked Questions Contact Us Faster Randomized Methods for Orthogonality Constrained Problems Boris Shustin , Haim Avron 25(257 1 59, 2024. Abstract Recent literature has advocated the use of randomized methods foraccelerating the solution of various matrix problems arising inmachine learning and data science . One popular strategy for leveraging randomization in numerical linear algebra is to use it as a way to reduce problem size . However , methods based on this strategy lack sufficient accuracy for some applications . Randomized preconditioning is another approach for leveraging randomization in numerical linear algebra ,

  • ENNS: Variable Selection, Regression, Classification, and Deep Neural Network for High-Dimensional Data

    Updated: 2024-12-31 19:41:44
    : , , , Home Page Papers Submissions News Editorial Board Special Issues Open Source Software Proceedings PMLR Data DMLR Transactions TMLR Search Statistics Login Frequently Asked Questions Contact Us ENNS : Variable Selection , Regression , Classification , and Deep Neural Network for High-Dimensional Data Kaixu Yang , Arkaprabha Ganguli , Tapabrata Maiti 25(335 1 45, 2024. Abstract High-dimensional , low-sample-size HDLSS data have been attracting people's attention for a long time . Many studies have proposed different approaches to dealing with this situation , among which variable selection is a significant idea . However , neural networks have been used to model complicated relationships . This paper discusses current variable selection techniques with neural networks . We showed

  • Estimation of Sparse Gaussian Graphical Models with Hidden Clustering Structure

    Updated: 2024-12-31 19:41:44
    Home Page Papers Submissions News Editorial Board Special Issues Open Source Software Proceedings PMLR Data DMLR Transactions TMLR Search Statistics Login Frequently Asked Questions Contact Us Estimation of Sparse Gaussian Graphical Models with Hidden Clustering Structure Meixia Lin , Defeng Sun , Kim-Chuan Toh , Chengjing Wang 25(256 1 36, 2024. Abstract Estimation of Gaussian graphical models is important in natural science when modeling the statistical relationships between variables in the form of a graph . The sparsity and clustering structure of the concentration matrix is enforced to reduce model complexity and describe inherent regularities . We propose a model to estimate the sparse Gaussian graphical models with hidden clustering structure , which also allows additional linear

  • Best Data Visualization Projects of 2024

    Updated: 2024-12-30 09:30:29
    Many datasets were analyzed and many charts were made this year. These are my favorite data visualization projects from 2024.Tags: best-of

  • Year of tornadoes

    Updated: 2024-12-26 10:38:28
    For the New York Times, Marco Hernandez visualized all the tornadoes in 2024,…Tags: New York Times, tornado

  • Data Advent Calendar

    Updated: 2024-12-23 09:02:59
    Jordan Cunliffe uses embroidery to explore and visualize data. This year, headed towards…Tags: Advent, embroidery, Jordan Cunliffe

  • Map of walkable neighborhoods

    Updated: 2024-12-20 11:41:50
    Researchers at Sony Computer Science Laboratories in Rome estimated the time it takes…Tags: city, local, walking

  • Scott Klein: tales of 2024

    Updated: 2024-12-17 06:14:16
    Latest episode >> It was a year in which a Presidential candidate said a chart saved his life, generative AI changed journalism and we worried about whether the field is getting less innovative. ⁠Scott Klein⁠ is an entrepreneur in residence at Newspack, helping publishers on WordPress and other platforms do great election coverage by building … Continue reading →

  • New Interesting Data Visualization Examples to Check Out — DataViz Weekly

    Updated: 2024-12-13 16:07:38
    Curious about what’s new and interesting in the world of data visualization? DataViz Weekly brings you a selection of projects with charts and maps that have caught our attention lately. Here’s what’s lined up this time: Threats of light and noise pollution to aquatic ecosystems — AquaPLAN 15-minute accessibility in urban areas — The Washington […] The post New Interesting Data Visualization Examples to Check Out — DataViz Weekly appeared first on AnyChart News.

  • Recent Data Visualization Projects Worth Exploring — DataViz Weekly

    Updated: 2024-12-06 16:31:50
    Sales : 1 888 845-1211 USA or 44 20 7193 9444 Europe customer login Toggle navigation Products AnyChart AnyStock AnyMap AnyGantt Mobile Qlik Extension Features Resources Business Solutions Technical Integrations Chartopedia Tutorials Support Company About Us Customers Success Stories More Testimonials News Download Buy Now Search News » Data Visualization Weekly » Recent Data Visualization Projects Worth Exploring — DataViz Weekly Recent Data Visualization Projects Worth Exploring — DataViz Weekly December 6th , 2024 by AnyChart Team Data visualization makes complex information accessible and insightful , serving as a valuable tool for both analysis and communication . This edition of DataViz Weekly features four recent projects that showcase its application across diverse . topics

  • David Spiegelhalter: The Art of Uncertainty

    Updated: 2024-12-04 02:03:55
    New podcast episode details: David Spiegelhalter⁠ is Emeritus Professor of Statistics in the Statistical Laboratory, University of Cambridge and author of new book ⁠The Art of Uncertainty⁠ We live in chaotic times and David makes that world a little clearer with humour and clarity in this special interview with Alberto and Simon. ⁠The⁠ ⁠⁠⁠⁠⁠⁠⁠music this … Continue reading →

Current Feed Items | Previous Months Items

Nov 2024 | Oct 2024 | Sep 2024 | Aug 2024 | Jul 2024 | Jun 2024